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277. The actual value may vary according to the particular 
application. In FIG. 8, for example, the threshold 249 has 
been set at 0.1 . Accordingly, a beat has been identified at a 
peak location 255. This beat would have been missed if the 
value for the threshold 249 had been greater than 0.1. 

When errors occur in audio transmittal applications using 
the Global System for Mobile Communications (GSM) 
protocol, the errors normally occur at random. Occasional 
losses of single or double packets are more likely to occur 
in Internet applications, where each packet has a duration of 
about 20 msec, to give a packet-loss error of about 40 msec 
in duration. Using this model, the capacity requirement of 
the circular FIFO buffer 50 can be reduced. When the 
reduced memory capacity is used, fewer audio data frames 
need to be stored in the circular FIFO buffer 50. 

In an alternative embodiment, the memory storage capac- 
ity of the circular FIFO buffer 50 can be reduced by storing 
only selected audio frames rather than every audio frame in 
the incoming stream. In a first example, shown in FIG. 12, 
two audio frames 301 and 303 at strong beat 1 are stored in 
the circular FIFO 50. Additionally, two audio frames 305 
and 307 at offbeat 2 are stored, two audio frames 309 and 
311 at strong beat 3 are stored, and two audio frames 313 and 
315 at offbeat 4 are stored in the circular FIFO 50. Note that 
none of the audio frames occurring between audio frames 
303 and 305, between audio frames 307 and 309, and 
between audio frames 311 and 313 are stored. Accordingly, 
when a defective audio frame 323 (frame 0) is identified, the 
defective frame 323 can be replaced by audio frame 301 
since the defective audio frame 323 occurs at a beat 327. In 
a conventional error concealment method, the defective 
audio frame 323 could be replaced by either a previous audio 
frame 321 (frame- 1) or by a subsequent audio frame 325 
(frame+1). 

The group of audio framed denoted by *n* includes four 
audio frames of which the audio frame 323 (frame 0), 
indicates the audio frame currently being sent to the listener 
via a loudspeaker, for example. The previously-received 
audio frame is audio frame 321 (frame- 1), and the next 
frame after the audio frame 323 is the audio frame 325 
(frame+1). The audio frame 325 is the next available audio 
frame to be decoded. 

In another embodiment, shown in FIG. 13, only two audio 
frames 331 and 333 at strong beat 1 and two audio frames 
335 and 337 at offbeat 2 have been stored, so as to place a 
smaller demand on the memory storage capacity of the 
circular FIFO 50. The next-arriving audio frame 345 
(frame+1) is interpolated with the previous audio frame 341 
to produce replacement data for a corrupted audio frame 343 
(frame 0). In the embodiment of FIG. 14, four audio frames 
351 (frame 0), 353 (frame+1), 355 (frame+2), and 357 
(frame+3) have been lost. Since this loss occurred at a beat 
location, the audio frames are replaced by previously-stored 
audio frames 361 and 363 occurring at strong beat 1. The 
audio frame 351 can be replaced by a previous audio frame 
365 (frame- 1), and the audio frame 357 can be replaced by 
the next audio frame 367 (frame+4) in the audio stream. 

FIG. 15 presents as a block diagram the structure of a 
mobile phone 400, also known as a mobile station, accord- 
ing to the invention, in which a receiver section 401 includes 
a beat detector control block 405 included in an audio 
decoder 403. A received audio signal is obtained from a 
memory 407 where the audio signal has been stored digi- 
tally. Alternatively, audio data may be obtained from a 
microphone 409 and sampled via an A/D converter 411. The 
audio data is encoded in an audio encoder 413 after which 
the processing of the base frequency signal is performed in 
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block 415. The channel coded signal is converted to radio 
frequency and transmitted from a transmitter 417 through a 
. duplex filter 419 (DPLX) and an antenna 421 (ANT). At the 
receiver section 401, the audio data is subjected to the 
5 decoding functions including beat detection, according to 
any of the teachings of the alternative embodiments 
explained above. The recorded audio data is directed 
through a D/A converter 423 to a loudspeaker 425 for 
reproduction. 

t0 FIG. 16 presents an audio information transfer and audio 
download and/or streaming system 450 according to the 
invention, which system comprises mobile phones 451 and 
453, a base transceiver station 455 (BTS), a base station 
controller (BSC) 457, a mobile switching center 459 (MSC), 

15 telecommunication networks 461 and 463, and user termi- 
nals 465 and 467, interconnected either directly or over a 
terminal device, such as a computer 469. In addition, there 
may be provided a server unit 471 which includes a central 
processing unit, memory, and a database 473, as well as a 

20 connection to a telecommunication network, such as the 
internet, an ISDN network, or any other telecommunication 
network that is in connection either directly or indirectly to 
the network into which the terminal having the decoder, 
including the beat detector of the invention, is capable of 

25 being connected either wirelessly or via a wired line con- 
nection. In audio data transfer system, according to the 
invention, the mobile stations and the server are point-to- 
point connected, and the user of the terminal 451 has a 
terminal including the beat detector in its decoder of the 

3 q receiver, as shown in FIG. 15. The user of the terminal 451 
selects audio data, such as a short interval of music or a short 
video with audio music, for downloading to the terminal. In 
the select request from the user, the terminal address is 
known to the server 473 and the detailed information of the 

3S requested audio data (or multimedia data) in such detail that 
the requested information can be downloaded. The server 
471 then downloads the requested information to the other 
connection end, or if connectionless protocols are used 
between the terminal 451 and the server 471, the requested 

4Q information is transferred by using a connectionless con- 
• nection in such a way that recipient identification of the 
terminal is attached to the sent information. When the 
terminal 451 receives the audio data as requested, it could be 
streamed and played in the loudspeaker of the receiver 

45 terminal in which the error concealment is achieved by 
applying the beat detection in accordance with one embodi- 
ment of the invention. 

The above is a description of the realization of the 
invention and its embodiments utilizing examples. It should 

50 be self-evident to a person skilled in the relevant art that the 
invention is not limited to the details of the above presented 
examples, and that the invention can also be realized in other 
embodiments without deviating from the characteristics of 
*the invention. Thus, the possibilities to realize and use the 

55 invention are limited only by the claims, and by the equiva- 
lent embodiments which are included in the scope of the 
invention. 

What is claimed is: 

1. A method for concealing errors detected in an input 
60 digital audio bit stream, the audio bit stream configured as 
a series of frames, said method comprising the steps of: 
detecting a first beat and a subsequent plurality of beats in 

the audio bit stream; 
defining a first inter-beat interval extending between said 
65 first beat and a (k+l)* subsequent beat; 

storing at least a portion of the audio bit stream occurring 
within said first inter-beat interval; 



US 7,069,208 B2 



13 



14 



detecting an erroneous audio segment occurring in a 
second inter-beat interval extending between said 
(k+l) rt beat and a (2k+l)** subsequent beat; and 

replacing at least a first part of said erroneous audio 
segment with a corresponding part of said stored audio 
bit stream portion, wherein the corresponding part is 
selected based on a time relationship between the first 
part and one of the (k+l) rt and (2k+l)* beats. 

2. A method as in claim 1 wherein *k* is an integer greater 
than or equal to 2. 

3. A method as in claim 1 wherein said stored audio bit 
stream portion includes at least one frame positioned on at 
least one of said beats. 

4. A method as in claim 1 wherein said step of detecting 
a first beat comprises a step of computing the variance of the 
audio bit stream using decoded IMDCT coefficients. 

5. A method as in claim 1 wherein said step of detecting 
a first beat comprises a step of utilizing a window-switching 
pattern. 

6. A method as in claim 1 wherein said step of detecting 
a first beat comprises a step of computing the envelope of the 
audio bit stream using decoded IMDCT coefficients. 

7. A method as in claim 1 wherein said step of detecting 
a first beat comprises steps of computing the variance of the 
audio bit stream using decoded IMDCT coefficients and 
utilizing a window-switching pattern. 

8. A method as in claim 1 wherein said step of storing at 
least a portion of the audio bit stream includes a step of 
storing said portion in a circular first-in first-out (FIFO) 
buffer. 

(l©) 9. A method as in claim 1 wherein the audio bit stream 

includes a music signal. 
(1*)10. A method as in claim 1 wherein the erroneous audio 
segment is the result of at least one of a packet loss from an 
IP network and a burst error from a wireless channel. 
( llOll. A method as in claim 1 further comprising the step of 
replacing one beat with another beat from a preceding bar. 
( \°tyl2, A method as in claim 1, wherein the first part has a 
time displacement x from one of the (k+l)** and (2k+l)* 
beats, and wherein the corresponding part is selected so as 
to have the same time displacement x from one of the first 
and (k+1)'* beats. 
(7.o)l3. a method as in claim 1, further comprising: 

teterrnining a confidence score, the confidence score 
being a percentage of correct beat detection within an 
observation window; and 
discontinuing said replacing step when the confidence 
score is below a threshold value. 
^2^4. A method as in claim 1, further comprising estimating 
an inter-beat interval according to the formula 

WlrlBIn* (l-ctH/W^H-'a, 

wherein IBI, is a current estimation of the inter-beat interval, 
IBI M is a previous estimation of the inter-beat interval, 
IBI w<rw is a most recently-detected inter-beat interval, and a 
is a weighting parameter. 

(x^)l5. A method as in claim 1, wherein said storing com- 
prises minimizing storage requirements by only storing 
frames adjacent to a strong beat or to an oflbeat. 

(3ft)l6. A method as in claim 1, further comprising replacing 
a corrupted audio frame by interpolating preceding and 
succeeding audio frames. 

fa>fil7. A method as in claim 1, further comprising replacing 
a second part of the erroneous audio segment preceding the 
first part of the erroneous audio segment with a frame 
preceding the second part. 
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(j*)l8. A method as in claim 1, further comprising replacing 
a second part of the erroneous audio segment following the 
first part of the erroneous audio segment with a frame 
following the second part. 

(ifc)l9. A method as in claim 1, further comprising: 

replacing a second part of the erroneous audio segment 
preceding the first part of the erroneous audio segment 
with a frame preceding the second part; and 
replacing a third part of the erroneous audio segment 
following the first part of the erroneous audio segment 
with a frame following the third part. 
U^O. A method as in claim 5, wherein said detecting a first 
beat and a subsequent plurality of beats further comprises: 
detecting strong beats and off-beats, and 
determining an interval between strong beats based on a 

statistical probability of inter-beat intervals. 
21. A method as in claim 20, wherein said detecting a first 
20 beat and a subsequent plurality of beats further comprises: 
determining the interval between strong beats based on a 
most probable inter-beat interval of approximately 600 
ms. 

U?l)22. A wireless terrninal comprising: 

a receiver section having a beat detector and an audio 
decoder, wherein the receiver section is configured to 
perform steps comprising 

detecting a first beat and a subsequent plurality of beats 

in an audio bit stream, 
defining a first inter-beat interval extending between 

said first beat and a (k+l)* subsequent beat, 
storing at least a portion of the audio bit stream 

occurring within said first inter-beat interval, 
detecting an erroneous audio segment occurring in a 
second inter-beat interval extending between said 
(k+1/* beat and a (2k+l) cfr subsequent beat, and 
replacing at least a first part of said erroneous audio 
segment with a corresponding part of said stored 
audio bit stream portion, wherein the corresponding 
part is selected based on a time relationship between 
the first part and one of the (k+1)* and (2k+l)* 
beats. 

I ^o)23. The wireless terminal of claim 22, wherein 'k' is an 
45 integer greater than or equal to 2. 
( V)24, The wireless terminal of claim 22, wherein said stored 
audio bit stream portion includes at least one frame posi- 
tioned on at least one of said beats. 
(>i)25. The wireless terminal of claim 22, wherein said step 
50 of detecting a first beat comprises a step of computing the 
variance of the audio bit stream using decoded IMDCT 
coefficients. 

(}Y)26. The wireless terminal of claim 22, wherein said step 
of detecting a first beat comprises the step of utilizing a 
55 window -switching pattern. 
(V*)27. The wireless terminal of claim 22, wherein said step 
of detecting a first beat comprises a step of computing the 
envelope of the audio bit stream using decoded IMDCT 
coefficients. 

6o(tf«8. The wireless terminal of claim 22, wherein said step 
of detecting a first beat comprises steps of computing the 
variance of the audio bit stream using decoded IMDCT 
coefficients and utilizing a window-switching pattern. 
ly»)29. The wireless terrninal of claim 22, wherein said step 

65 of storing at least a portion of the audio bit stream includes 
a step of storing said portion in a circular first-in first-out 
(FIFO) buffer. 
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f W30. The wireless terminal of claim 22, wherein the audio /fl)37. The wireless terminal of claim 22, wherein the 

bit stream includes a music signal. receiver section is configured to perform steps comprising: 

[^31. The wireless terminal of claim 22, wherein the erro- replacing a second part of the erroneous audio segment 

neous audio segment is the result of at least one of a frame preceding the first part of the erroneous audio segment 

loss from an IP network and a burst error from a wireless 5 with a frame preceding the second part, 

channel. (MVB8. The wireless terminal of claim 22, wherein the 

{ ^) 32. The wireless terminal of claim 22, wherein the first receiver section is configured to perform steps comprising: 

part has a time displacement x from one of the (k+1)* and replacing a second part of the erroneous audio segment 

(2k+l) /A beats, and wherein the corresponding part is following me first part of the erroneous audio segment 

selected so as to have the same time displacement t from one 10 w j m a frame following the second part, 

of the first and (k+1)^ beats. (H(p)j9. The wireless terminal of claim 22, wherein the 

(H«133. The wireless terminal of claim 22, wherein the receiver section is configured to perform steps comprising: 

receiver section is configured to perform steps comprising: rep]acmg a ^ of ±e erroneous audio segment 

determining a confidence score, the confidence score preceding me first part of me erroneous audio segment 

being a percentage of correct beat detection within an 15 w j m a frame preceding the second part, and 

observation window, and replacing a third part of the erroneous audio segment 

discontinuing said replacing step when the confidence following the first part of the erroneous audio segment 

score is below a threshold value. with a frame following the third part. 

(mj)34. The wireless terminal of claim 22, wherein the 2Q (n$n. The wireless terminal of claim 26, wherein said 

receiver section is configured to perform steps comprising: detecting a first beat and a subsequent plurality of beats 

estimating an inter-beat interval according to the formula further comprises: 

detecting strong beats and off-beats, and 
determining an interval between strong beats based on a 

wherein 1BI, is a current estimation of the inter-beat interval, 25 f v st ^ stica l probability of inter-beat intervals 

IBI,,, is a previous estimation of the inter-beat interval, ^ wirel f s terminal of claim 40, wherein said 

IBI new is a most recently-detected inter-beat interval, and a *** tin S 3 3 subsequent plurality of beats 

is a weighting parameter. further comprises: 

(Ht)35. Tlie wireless terminal of claim 22, wherein said 30 toermini ng between f 0 ^ beats based ona 

storing comprises minimizing storage requirements by only most *> robable ^r-beat interval of approximately 600 

storing frames adjacent to a strong beat or to an offbeat. . » m ;L . , . , . , . 

lh ,v„ ™ . t • i o i . ^ , . , W>l2. The wireless terminal of claim 22, wherein the 

Crt)36. The wireless terminal of claim 22, wherein the section is configured to perform ^ step ofre pl a c- 

receiver section is configured to perform steps comprising: ing one ^ with another beat ^ , preceding bar . 



replacing a corrupted audio frame by interpolating pre- 
ceding and succeeding audio frames. 
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